Search results for "Parallel process"

showing 10 items of 34 documents

CUSHAW2-GPU: Empowering Faster Gapped Short-Read Alignment Using GPU Computing

2014

We present CUSHAW2-GPU to accelerate the CUSHAW2 algorithm using compute unified device architecture (CUDA)-enabled GPUs. Two critical GPU computing techniques, namely intertask hybrid CPU-GPU parallelism and tile-based Smith-Waterman map backtracking using CUDA, are investigated to facilitate fast alignments. By aligning both simulated and real reads to the human genome, our aligner yields comparable or better performance compared to BWA-SW, Bowtie2, and GEM. Furthermore, CUSHAW2-GPU with a Tesla K20c GPU achieves significant speedups over the multithreaded CUSHAW2, BWA-SW, Bowtie2, and GEM on the 12 cores of a high-end CPU for both single-end and paired-end alignment.

BacktrackingComputer scienceParallel computingSoftware_PROGRAMMINGTECHNIQUESShort readComputational scienceCUDAParallel processing (DSP implementation)Hardware and ArchitectureParallelism (grammar)Electrical and Electronic EngineeringGeneral-purpose computing on graphics processing unitsSoftwareComputingMethodologies_COMPUTERGRAPHICSIEEE Design & Test

researchProduct

Exploring parallel capabilities of an innovative numerical method for recovering image velocity vectors field

2010

In this paper an efficient method devoted to estimate the velocity vectors field is investigated. The method is based on a quasi-interpolant operator and involves a large amount of computation. The operations characterizing the computational scheme are ideal for parallel processing because they are local, regular and repetitive. Therefore, the spatial parallelism of the process is studied to rapidly proceed in the computation on distributed multiprocessor systems. The process has shown to be synchronous, with good task balancing and requiring a small amount of data transfer.

ComputationNumerical analysisProcess (computing)MultiprocessingField (computer science)Computational scienceComputer Science ApplicationsSettore MAT/08 - Analisi NumericaOperator (computer programming)Parallel processing (DSP implementation)Modeling and SimulationModelling and SimulationImage velocity vectors field Quasi-interpolant operator B-spline functions Distributed multiprocessor systemsAlgorithmMathematicsData transmissionMathematical and Computer Modelling

researchProduct

Practical considerations for acoustic source localization in the IoT era: Platforms, energy efficiency, and performance

2019

The rapid development of the Internet of Things (IoT) has posed important changes in the way emerging acoustic signal processing applications are conceived. While traditional acoustic processing applications have been developed taking into account high-throughput computing platforms equipped with expensive multichannel audio interfaces, the IoT paradigm is demanding the use of more flexible and energy-efficient systems. In this context, algorithms for source localization and ranging in wireless acoustic sensor networks can be considered an enabling technology for many IoT-based environments, including security, industrial, and health-care applications. This paper is aimed at evaluating impo…

Computer Networks and CommunicationsComputer scienceDistributed computingContext (language use)02 engineering and technologyParallel architectures0202 electrical engineering electronic engineering information engineeringParallel processingWirelessSignal processingMulti-core processorHeterogeneous (hybrid) systemsbusiness.industry020206 networking & telecommunicationsAcoustic source localizationWireless acoustic sensor networks (WASNs)Computer Science ApplicationsEnergy efficiencyHardware and ArchitectureSignal Processing020201 artificial intelligence & image processingElectrónicabusinessWireless sensor networkSource localizationInformation SystemsEfficient energy useAcoustic signal processing

researchProduct

Application based on dynamic reconfiguration of field-programmable gate arrays: JPEG 2000 arithmetic decoder

2005

This paper describes the implementation of a part of the JPEG 2000 algorithm (MQ decoder and arithmetic decoder) on a field-programmable gate array (FPGA) board by using dynamic reconfiguration. A comparison between static and dynamic reconfiguration is presented, and new analysis criteria (spatiotemporal efficiency, logic cost, and performance time) have been defined. The MQ decoder and arithmetic decoder are attractive for dynamic reconfiguration implementation in applications without parallel processing. This implementation is done on an architecture designed to study the dynamic reconfiguration of FPGAs: the ARDOISE architecture. The obtained implementation, based on four partial config…

Computer scienceGeneral EngineeringControl reconfigurationcomputer.file_formatAtomic and Molecular Physics and OpticsParallel processing (DSP implementation)Gate arrayJPEG 2000System on a chipHardware_ARITHMETICANDLOGICSTRUCTURESArithmeticField-programmable gate arraycomputerImage compressionOptical Engineering

researchProduct

Parallel laser micromachining based on diffractive optical elements with dispersion compensated femtosecond pulses

2013

We experimentally demonstrate multi-beam high spatial resolution laser micromachining with femtosecond pulses. The effects of chromatic aberrations as well as pulse stretching on the material processed due to diffraction were significantly mitigated by using a suited dispersion compensated module (DCM). This permits to increase the area of processing in a factor 3 in comparison with a conventional setup. Specifically, 52 blind holes have been drilled simultaneously onto a stainless steel sample with a 30 fs laser pulse in a parallel processing configuration.

DiffractionFemtosecond pulse shapingMaterials scienceChromatic aberrationElectromagnetic pulseDiffraction efficiencyEngineering controlled termsUltrashort pulseslaw.inventionOpticslawLaser micro-machiningChromatic aberrationParallel processingDispersionsElectromagnetic pulseHigh spatial resolutionbusiness.industryEngineering main headingLaserBlind holesAtomic and Molecular Physics and OpticsAberrationsPulse stretchingParallel processing (DSP implementation)Fs laser pulseFemtosecondbusinessOptics Express

researchProduct

Accelerating short read mapping on an FPGA (abstract only)

2012

The explosive growth of short read datasets produced by high throughput DNA sequencing technologies poses a challenge to the mapping of short reads to a reference genome in terms of sensitivity and execution speed. Existing methods often use a restrictive error model for computing the alignments to improve speed, whereas more flexible error models are generally too slow for large-scale applications. Although a number of short read mapping software tools have been proposed, designs based on hardware are relatively rare. In this paper, we present a hybrid system for short read mapping utilizing both software and field programmable gate array (FPGA)-based hardware. The compute intensive semi-g…

Dynamic programmingSpeedupSoftwareParallel processing (DSP implementation)Computer sciencebusiness.industryHybrid systemSensitivity (control systems)Parallel computingShort readbusinessField-programmable gate arrayProceedings of the ACM/SIGDA international symposium on Field Programmable Gate Arrays

researchProduct

Study of the Audio Susceptibility in Parallel Power Processing With a High-Power Topology

2009

In this paper, the audio susceptibility characteristic of a high-efficiency nonisolated topology that processes only a part of the total power delivery is analyzed. Since the proposed topology presents a direct path from input to output, the effect of the input voltage ripple at the output voltage has been studied. The effect on the audio susceptibility of the values and disposition of the components and the effect of their parasitic elements must be taken into account. Due to this study, the analytical expression of the audio susceptibility and the design criteria to improve it have been obtained.

EngineeringParallel processing (DSP implementation)business.industryPower electronicsRippleElectronic engineeringTopology (electrical circuits)Power-flow studyElectrical and Electronic EngineeringbusinessExpression (mathematics)VoltagePower (physics)IEEE Transactions on Power Electronics

researchProduct

Parallel In-Memory Evaluation of Spatial Joins

2019

The spatial join is a popular operation in spatial database systems and its evaluation is a well-studied problem. As main memories become bigger and faster and commodity hardware supports parallel processing, there is a need to revamp classic join algorithms which have been designed for I/O-bound processing. In view of this, we study the in-memory and parallel evaluation of spatial joins, by re-designing a classic partitioning-based algorithm to consider alternative approaches for space partitioning. Our study shows that, compared to a straightforward implementation of the algorithm, our tuning can improve performance significantly. We also show how to select appropriate partitioning parame…

FOS: Computer and information sciencesComputer Science - DatabasesComputer Science - Distributed Parallel and Cluster ComputingParallel processing (DSP implementation)Computer scienceOrder (business)JoinsJoin (sigma algebra)Databases (cs.DB)Parallel computingDistributed Parallel and Cluster Computing (cs.DC)Computer Science::Databases

researchProduct

GekkoFS - A Temporary Distributed File System for HPC Applications

2018

We present GekkoFS, a temporary, highly-scalable burst buffer file system which has been specifically optimized for new access patterns of data-intensive High-Performance Computing (HPC) applications. The file system provides relaxed POSIX semantics, only offering features which are actually required by most (not all) applications. It is able to provide scalable I/O performance and reaches millions of metadata operations already for a small number of nodes, significantly outperforming the capabilities of general-purpose parallel file systems. The work has been funded by the German Research Foundation (DFG) through the ADA-FS project as part of the Priority Programme 1648. It is also support…

File system020203 distributed computingBurst buffersParallel processing (Electronic computers)Computer scienceProcessament en paral·lel (Ordinadors)020207 software engineering02 engineering and technologyBuffer storage (Computer science)computer.software_genreData structureDistributed file systemsMetadataParallel processing (DSP implementation)POSIXServerScalabilityHPC0202 electrical engineering electronic engineering information engineeringOperating systemHigh performance computingDistributed File System:Informàtica::Arquitectura de computadors::Arquitectures paral·leles [Àrees temàtiques de la UPC]computerCàlcul intensiu (Informàtica)2018 IEEE International Conference on Cluster Computing (CLUSTER)

researchProduct

The HPC Certification Forum: Toward a Globally Acknowledged HPC Certification

2020

The goal of the HPC Certification Forum is to categorize, define, and examine competencies expected from proficient HPC practitioners. The community-led forum is working toward establishing a globally acknowledged HPC certification process, a process that engages with HPC centers to identify gaps in users’ knowledge, and with users to identify the skills required to perform their tasks. In this article, we introduce the forum and summarize the progress made over the last two years. The release of the first officially supported certificate is planned for the second half of 2020.

General Computer ScienceComputer scienceProcess (engineering)05 social sciencesGeneral Engineering050301 educationCertificationCertificateSupercomputerEngineering managementParallel processing (DSP implementation)Task analysis0501 psychology and cognitive sciences0503 education050104 developmental & child psychologyComputing in Science & Engineering

researchProduct